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I .   INTRODUCTION 


In  previous  works  on  job  scheduling,  a  multiprocessor  system 
was  modeled  as  one  containing  functionally  identical  processors. 
A  task  in  a  given  set  of  tasks  can  be  assigned  to  any  one  of  the 
processors  in  the  system.   The  time  required  to  complete  a  task  may  depend 
only  on  the  execution  time  of  the  task  in  the  case  where  all  processors 
are  identical  or  may  also  depend  on  the  speed  of  the  processor  on  which 
the  task  is  being  processed  in  the  case  where  processors  have  different 
speeds.   For  such  multiprocessor  systems,  efficient  algorithms  which 
yield  schedules  with  minimum  completion  time  or  mean  flow  times  have  been 
found  for  some  special  cases  [1-5].   Worst  case  performance  bounds  of  many 
heuristic  suboptimal  scheduling  algorithms  have  also  been  obtained  [6-8]. 

In  this  report,  we  introduce  two  general  models  of  multiprocessor 
systems  containing  different  types  of  processors.   A  task  can  be  assigned 
only  to  a  processor  of  certain  types.    We  described  in  Section  II 
a  model  corresponding  to  systems  in  which  processors  are  functionally 
dedicated  [91.   That  is,  a  task  can  be  executed  only  on  a  particular  type 
of  processors.   (For  example,  some  processors  are  front  end  processors  for 
I/O  functions,  some  processors  are  to  perform  program  compilation,  some 
processors  are  designed  specially  for  merging/sorting  operations,  and  other 
are  designed  for  extensive  numerical  calculations.)   Clearly,  the  model  of 
multiprocessor  systems  with  identical  processors  is  a  special  case  of  our 
model.   This  model  also  includes  the  job  shop  problem  in  which  there  is 
exactly  one  processor  of  each  type. 

In  Section  III,  a  more  general  model  is  presented.   In  this 
general  model,  each  type  of  functionally  dedicated  processors  is  further 


I 


divided  into  subtypes  with  a  partial  ordering  relation  defined  over  the 
processor  subtypes.   Thus,  multiprocessor  systems  in  which  some  pro- 
cessors are  functionally  identical  but  have  dedicated  memories  of  differ- 
ent sizes  can  be  modeled. 
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II.   MULTIPROCESSOR  SYSTEMS  WITH  FUNCTIONALLY  DEDICATED  PROCESSORS 


Consider  a  model  of  a  multiprocessor  system  in  which  there 
are  r  different  types  of  functionally  dedicated  processors.   We  assume 
in  this  section  that  processors  of  the  same  type  are  identical.   We  shall 
refer  to  these  r  types  of  processors  as  type  1,  type  2,  ...  ,  type  r  pro- 
cessors.  A  multiprocessor  system  with  m  type  1  processors,  m0  type  2 

processors,  ...  ,  and  m   type  r  processors  can  be  represented  by  the 

r 

ordered  r-tuple  P  =  (m, ,  m„ ,  ...  ,  m  ).   We  let  m  =  2  m.. 

12         r  .  ,   j 

3=1 

Let  T  =  {Tn ,  T.,  ...  ,  T  }  be  a  set  of  tasks  to  be  executed  on 
12         n 

a  system  P.   A  task  T.  is  said  to  be  a  type  j  task  if  it  can  only  be 

executed  on  a  type  j  processor.   Formally,  we  define  a  function  A  from 

T  to  the  set  {1,  2,  ...  ,  r}  such  that  A(T.)  is  the  type  of  task  T.. 

We  denote  the  time  required  to  complete  a  task  T.  (on  a  type  j  processor) 

by  u(T.)  where  u  is  a  function  from  T  to  the  reals.   u(T.)  shall  be  referred 

to  as  the  execution  time  of  T.. 

l 

We  suppose  that  there  is  a  precedence  relation  <  defined  over 

the  set  T.   That  T.  precedes  T.  (or  T.  follows  T.)  is  written  as  T.  <  T. 
i  3  1  i  i    J 

and  means  that  the  execution  of  T.  cannot  begin  before  the  execution  of  T. 

J  i 

is  completed.   A  task  is  said  to  be  executable  at  a  certain  time  if  the 
tasks  preceding  it  have  been  completed.   Formally,  a  set  of  tasks  are 
represented  by  ordered  quadruple  (T,  A,  u,  <) .   We  shall  also  use  the 
notation  (T.,  A(T.),  u(T.))  to  represent  to  a  particular  task  T. . 

We  can  depict  a  set  of  Tasks  (T,  A,  y,  <)  graphically  as  il- 
lustrated by  the  example  in  Figure  1,  where  T  ,  T~,  and  T,.  are  type  1 
tasks  with  execution  time  3,  4  and  2,  respectively.   T„  and  T,  are  type  2 
tasks  with  execution  time  4  and  2,  respectively. 

We  want  to  determine  the  performance  of  a  class  of  scheduling 


algorithms  in  which  processors  are  never  left  idle  intentionally.   These 
algorithms  are  known  as  priority  driven  scheduling  algorithms  and  can  be 
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(Tr  1,  3) 


(T2,  1,  4) 


(T4,  2,  2) 


(T3,  2,  4) 


(T5,  1,  2) 


Figure  1.   An  example  of  a  Set  of  Tasks  (T,  A,  u,  <) . 


described  by  the  priorities  assigned  to  the  tasks.   At  any  moment  when  a 
type  i  processor  is  free,  the  task  that  has  the  highest  priority  among  all 
executable  type  i  tasks  is  scheduled.   For  example,  for  the  set  of  tasks  in 
Figure  2(a),  the  schedule  in  Figure  2  (b)  is  obtained  according  to  the 
priority  list  (T.  ,  T0,  T„,  T.  ,  Tc,  T , ,  T..,  T0,  Tn)  in  which  tasks  appear 
in  decreasing  priorities.   Schedules  produced  by  priority  driven  scheduling 
algorithms  are  known  as  priority  driven  schedules. 

Let  a)  and  to'  denote  the  completion  times  of  a  set  of  tasks 
(T,  A,  u,  <)  when  executed  on  a  system  P  =  (m  ,  m  ,  ...  ,  m  )  according  to 
a  priority-driven  schedule  and  an  arbitrary  schedule,  respectively.   We 
have 
Theorem  1 


-  ,<  1  +  r 

or — 


min/  1  \ 


(1) 


Moreover,  the  bound  is  the  best  possible. 

Proof :   Let  t.  denote  the  sum  of  the  execution  times  of  all  type  j  tasks 

in  (T,  A,  u,  <) .   (For  example,  for  the  set  of  tasks  in  Figure  2,  t,  =  5 -j 

and  t„  =  3-^-).   Let  (b(nn,  n„ ,  ...  ,  n  )  denote  the  sum  of  idle  times  in 
2     2  1   2         r 

all  processors  during  which  there  are  n  idle  processors  of  type  1,  n~ 
idle  processors  of  type  2,  ...  ,  and  n  idle  processors  of  type  r  in  a 
priority-driven  schedule.   (In  the  example  shown  in  Figure  2,  4>(1,0)  =1, 
((.(2,0)  =|  +  |=1,  (j>(l,l)  =2  +  2  =  4,  <K0,0)  =  0,  4>(0,1)  =  0,  and  <j>(2,l) 
=  0.)  Clearly,  the  total  idle  time,  $,  is  given  by 


m„ 


m. 


m 


=  2      2 
n^O  n2=0 


2   4)(n1,  n2,  ...  ,  nr) 

n  =0 
r 


(2) 


with  (fi  (m  ,  m  ,  ...  ,  m  )  =  0.    The  completion  time  of  the  priority- 


Note  that  <f>(m  ,  m  . 


m  )  =  0  in  a  priority-driven  schedule, 


V. 


(T^  1,1) 


(T2,  1.1) 


(T3,  2,1) 


(T6,l,f) 

(T?,l,|) 
(TQ,2,1) 


(T9,2,f) 


(T4,  1,1) 


(T5,l,l) 


:; 
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////////A 
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T6 

T7 

V/////////////A 

3 

2 

3 
2 

Y/////////////A 

Type  2 


Figure  2.   Example  of  a  priority  driven 
schedule  on  a  multiprocessor  system  con- 
taining different  types  of  processors. 


driven  schedule  to  is  given  by 


a)  =  -  (  2  t.+  *) 
m  j=l  * 


(3) 


To  find  an  upper  bound  to  the  total  idle  time,  $,  let  us 
examine  the  terms 


m, 


m. 


I  =    s 


n  =1  n  =1 


m 


n  =1 
r 


((>(n1,  n2, 


•  n  ) 
r 


in  the  sum  in  Eq .  (2) .   During  the  time  periods  corresponding  to  the 
terms  in  the  sum  I,  there  is  at  least  one  idle  processor  among  the  pro- 
cessors of  each  type.   Therefore,  there  is  a  chain  of  tasks  in  T  that  is 
executed  sequentially  during  these  idle  periods,  and  must  be  executed 
sequentially  in  any  schedule.   Thus,  we  conclude  that 

I  £  (m-l)oj'  (4) 

since  no  schedule  can  have  a  completion  time  oj'  less  than  the  length  of 
this  chain. 


Let  K.  denote  the  sum  of  lengths  of  idle  periods  during  which 


all  type  j  processors  are  busy.   We  have 


ra-m. 

K.  <  '- 

J  -  m 


I 


m.-n . 
_J 1 


l<n.<m.-l 

-  J-  J 

0<n  <m   for  u^i 

—  u—  u 


n1+n„+.  .  .+n 
12      r 


(n..  ,n~,  .  .  .  ,n^_) 


m-m .      m-m . 

1  t 1 

m    j    m 


< 


I 


m_l   l<n.<m.  , 

-  J-  1-1 

0<n  <m  for  u^i 

—  u—  u 


(n1,n2, . . . ,nr) 


Furthermore, 


2  K.  < 


J-l 


r  m-m. 
J-l 


m , 


t .  - 


r  m-m. 
S 

j-l 


m , 


1 

m-1 


I 


l<n.<m.-l 
~  J-  J 

0<n  <m  for  u^i 
—  u—  u       J 


(n  ,n2, 


•V 
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I 
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'1 


13 
13 
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:c 
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r  m-m.              .    m-m. 
v     -\                 I      rain     i 

—  .  n   m.    i   m-l    l<j<r   m. 
J=l   J                    J 

(5) 

Since 

r 

$  <  I  +  2  K. 

we  have  from  Eq.  (5), 

r  m-m. 

$  <  2  1  t.  +  I 

—  .  -   m.   j 
J=l   J 

i      .   m-m . ^ 
,     1    mm 

m-l   l<j<r  mj 

• 

Combining  this  expression  with  the  inequality  in  Eq.  (4)  and  that 

t . 

-J-  <  a)'      j  =  1,  2,  ...  ,  r. 
ra.  — 
3 

We  have 

,       /     min   i  \ 
$  <  m(r-l)to'  +  raw'  1  1  -  1<j<r  —  1 

(6) 

Furthermore,  since 

i   r 

-  .2,  t.  <  a)'. 

(7) 

We  have  the  inequality  Eq.(l)  by  substituting  Eqs.  (6)  and  (7)  into 

Eq.  (3). 

The  example  shown  in  Figure  3  demonstrates  that  the  upper 

bound 

in  Eq.  (1)  is  best  possible.   In  this  example,  we  have 

„  _  max 
ra,  -        m. 
1   l<j<r   j 

There  are  ra  (m  -1)  +  2  type  1  tasks  which  are  denoted  as  T^,  T^*  . 

•  •  > 

T   ,  ,        , N  „^  .   Their  execution  times  are 
1, (m1(m1-l)+2) 

M(TU)  -  e 


u(T..)=-    i  =  2,  3,...  ,m  (m  -1)  +  1 
li     mn  J-   -1- 


There  are  m.  +  1  type  j  tasks  for  j  =  2,  3,  . . .  ,  r  which  we  shall  denote  as 


V  V 


•  '  Tj,(.yi)  for  J  =  2-  3' 


and 


r.   Their  execution  times  are 


u(Tj£)  =1     j  =  2,  3,  ...  , 


I   =  1,  2,  ...  ,  m. 


y(Tj,(m.+l))=£   J  =  2,  3,  ...  ,  r 
J 

Furthermore,  the  precedence  relation  is  as  shown  in  Figure  3a.   The 
priority  schedule  in  Figure  3b  is  obtained  according  to  the  priority  list 


T12'  T13'  '••  '  TU(m1(m1-l)+l)»  Tll»  T21'  T22  "•  T2,(m2+1)  '  T31'  T32' 


Tr,(mr+1)'  Tl,(m1(m1-l)+2)/ 


It's  complete  time  is 


m  -1 
a)  =  r  -\ 1-  re 


m„ 


while  the  completion  time  of  the  schedule  shown  in  Figure  3c  is 

u)1  =  1  +  re 
Hence,  the  bound  in  Eq.  (1)  is  achieved  for  very  small  e 

When  the  processors  in  the  system  are  all  identical,  r 
The  result  in  Theorem  1  reduces  to  the  well-known  result  [6]: 
Corollary  1:   For  a  system  containing  m  identical  processors 

^<  2-i 

0)'  —     m 

On  the  other  hand,  for  a  job  shop  problem,  m..  =  m_  =  .  .  .  =  m  =  1 

In  this  case,  we  have, 

Corollary  2:   For  a  r-processor  job  shop 
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III.   GENERALIZATION 

The  model  of  a  multiprocessor  system  in  which  each  processor 
has  a  dedicated  memory  of  a  certain  fixed  capacity  was  studied  in  [8]. 
In  this  case,  the  execution  of  a  task  requires  a  certain  amount  of  memory 
space  and  can  only  take  place  on  a  processor  whose  memory  capacity  is  larger 
than  or  equal  to  its  memory  requirement.   Our  model  in  Section  II  can  be  ex- 
tended such  that  every  processor  is  characterized  by  its  type  as  well  as 
by  its  memory  capacity.   We  shall,  however,  present  a  more  general  model. 

Consider  a  multiprocessor  system  consisting  of  processors  of  r 
different  types.   Each  type  of  processors  is  further  divided  into  sub- 
types.  Thus,  we  shall  refer  to  a  processor  as  a  type  (j,k)  processor  when 
it  is  a  type  j  processor  of  subtype  k.   A  partial  ordering  relation  <  is 
defined  over  the  processor  types  such  that 

(i)   (j,k)  and  (n,v)  are  incomparable   if  j  ^  n. 
(ii)   (j,k)  <  (j,v)  means  that  if  a  task  can  be  executed  on  a 
tyPe  (j>k)  processor  then  it  can  also  be  executed  on  a  type  (j ,v)  processor. 
A  multiprocessor  system  can  then  be  represented  as  P  =  (m   ,  m  _,..., m   ; 

^,)  nin,,..Mm0(]  » . .  .  m  _ .  m  _.  ...  ,  m   ',  4  )   where  m#1  is  the  number  of 


'21'  "22"'"m2V-  mrl'  mr2' 


rl 


jk 


tyPe  (j>k)  processors  and  ^  is  a  partial  ordering  relation  over  the 
processor  types.   We  shall  let 


m 


=  2 


m. 


k=l   Jk 


m 


=  2 
j=l 


m. 


For  example,  the  multiprocessor  system  represented  by  the  directed  graph 
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4. 

Neither  (j,k)  <  (u,v)  nor  (u,v)  <  (j ,k) . 

■L 

There  is  an  edge  from  (j,k)  to  (j  ,v)  means  (j,v)  -<  (j,k). 
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in  Figure  4  contains  three  types  of  processors.   There  are  three  type  1  pro- 
cessors of  the  same  subtype.   The  3  processors  of  type  2  having  dedicated 
memories  of  different  sizes  are  linearly  ordered  as  shown.   For  the  4  pro- 
cessors of  type  3,  neither  (3.1)  <  (3,2)  nor  (3,2)  <  (3.1)  but  (3,3)  <  (3,1) 
and  (3,3)  •<  (3,2).   This  system  can  be  represented  as  P=(3;l,l,l;  2, 1,1,  ). 

Let  T  =  {T..  ,  T„,  ...  ,  T  }  be  a  set  of  tasks  to  be  exectued  on 
1   £.  n 

a  system  P.   A  task  T.  is  said  to  be  a  type  (j,k)  task  if  it  can  be  executed 
on  any  type  (j,v)  processor  for  (j,k)  -K.    (j,v)  and  on  no  other  type  of  pro- 
cessors.  Thus,  a  set  of  tasks  can  be  specified  by  an  ordered  4-tuple 
(T,  A,  y,  <)  where  A  is  a  function  from  T.  to  the  processor  types  so  that 
A(T.)  specifies  the  type  of  T.,  and  y  and  <  are  as  defined  in  Section  II. 

Again,  we  use  the  notation  (T.,  A(T.),  y (T . )  )  to  represent  a  particular  task  T., 

Consider  all  type  (j,k)  processors  for  a  fixed  j.   The  smallest  sub- 
set of  types  {(j,k  ),  (j,k2),  ...  (j,k  )}  is  said  to  be  the  dominating  set  if 
for  any  type  (j,k),  (j,k)  -<  (j,k  )  for  some  1  <_  p  <_  q.   In  other  words,  any 
tyPe  (j»k)  task  can  be  executed  on  a  processor  whose  type  belongs  to  the 
dominating  set.   For  example,  in  the  multiprocessor  system  shown  in  Figure  4, 
{(1,1)},  {(2,1)}  and  {(3,1),  (3,2)}  are  dominating  sets.  We  also  refer  to  a 
type  (j,k)  as  a  maximal  type  if  it  is  in  the  dominating  set. 
For  a  given  dominating  set,  let 

m._  =  min{m..  |(i,k  )  is  in  the  dominating  set} 
jO        jk  '    p 

In  other  words,  m.n  is  the  minimum  of  the  numbers  of  processors  among  all 
types  of  processors  in  the  dominating  set.   Therefore,  m#n  is  equal  to  3,  1, 
and  1  for  j  =  1,  2,  and  3,  respectively,  in  our  example.   The  result  in 
Theorem  1  can  be  generalized  to 
Theorem  2 


(0 


m 


,  <  1  +  E 
a)'  —    .  ,  m 


J.  _ 


mm 


l<j<r  m.„ 

j-i  -jo   --  JO 


(8) 


& 


13 

i; 

!» 
Q 

:: 

! 


16 


Proof.   Again,  let  t.  denote  the  total  execution  times  of  all  type  (j,k) 
tasks  for  k  =  1,  2,  ...  ,  £. .   For  a  priority-driven  schedule,  let 


<Knn,  n12""»nul'  n21' 


,  ...  ,  n0„  ,  ...  ,  n   ,  ...  ,  nf£  )  denote  the 
2  r 


sum  of  idle  times  in  all  processors  in  P  during  which  there  are  n    idle 
processors  of  type  (j,k)  for  j  =  1,  2,  ...  r  and  k  =  1,  2,  ...  ,  I . .   And 

the  total  idle  time 


11 


'12 
Z 


m 


nu-0n12-0 


r£ 


n  n  =0 
r 


*(nn,  n12, 


r£ 

r 


(9) 


Hence  the  completion  time  of  the  priority  driven  schedule  is 
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oj  =  — 
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J  =  l 


as  given  by  Eq.  (3). 

Let  I  be  the  sum  of  all  terms  in  Eq.  (9)  corresponding  to 
idle  periods  during  which  at  least  one  of  each  maximal  type  processor  is 
idle.   That  is 
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where  S   =  {1,  2,  ...  ,  m   -  1}  if  (j,k)  is  a  maximal  type  and,  S  ,  = 

J  k  J  K  J  K- 

{0,  1,  2,  ...  ,  m   }  if  (j,k)  is  not  a  maximal  type, 
jk 

Again,  there  is  a  chain  of  task  in  T  such  that  during  these 
idle  periods  one  of  the  other  processors  is  executing  a  task  in  the  chain, 

Hence 

I  <  (ra-lW  (1Q) 

Let  K  denote  the  sum  of  lengths  of  idle  periods  during  which  all  the 

j 
processors  of  one  or  more  maximal  types  of  the  form  (j,*)  are  busy.   We 
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Since 
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and 
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as  well   as    Eq.    (10) ,    we   obtain 
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which  is  the  bound  given  by  Eq.  (8).   When  the  dominating  sets  contains 
only  one  subtype  for  all  j  -  1,  2,  ....  r,  (that  is,  there  is  an  unique 

maximal  subtype  for  all  types),  we  have 

Corollary  3:   The  bound  given  by  (8)  is  the  best  possible. 

That  this  bound  is  best  is  illustrated  by  the  example  in  Figure  5, 
When  the  multiprocessor  system  contains  only  one  type  of  pro- 
cessors, we  have  m.  =  0  for  i=2,  3,  ...  ,  r.   The  bound  given  by  Theorem 


2  is  simply 
Corollary  4: 


a)        mi 

771  1  + 
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m. 
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The  bound  derived  in  [ 8]  for  processors  with  different  storage  capacities 
is  a  special  case  of  our  result  with  m   =1. 
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Figure  5b.   Worst  Case  Priority-Driven  Schedule 
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Figure  5c.   Optimal  Schedule 
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